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ON THE SYLVESTER MATRIX EQUATION OVER 
QUATERNIONS 

VLADIMIR BOLOTNIKOV 


Abstract. The Sylvester equation AX — XB = C is considered in the setting of 
quaternion matrices. Conditions that are necessary and sufficient for the existence 
of a unique solution are well-known. We study the complementary case where the 
equation either has infinitely many solutions or does not have solutions at all. 
Special attention is given to the case where A and B are respectively, lower and 
upper triangular two-diagonal matrices (in particular, if A and B are Jordan 
blocks). 


1. Introduction 

We start with the complex case: given complex matrices A G B G 

C G the Sylvester equation 

AX-XB = C (1.1) 

has a unique solution X = [xij] G if and only if the spectrums (j{A) and (t{B) 

are disjoint. The result was established by Sylvester m via representing (HI) in 
the equivalent form Gx = c where G = A ® — In ® B^ and where x, c are the 

columns constructed from the entries of X and C by 

X — Coli<2<7x (Coli<^<77iX2j) , C — Coli<2<7T, (Coli<2<raQj) • (^•^) 

Thus, the equation (|l.ip has a unique solution (for any C G if and only if 

the matrix G is invertible (equivalently, (t{A) ncT(R) = 0), in which case the unique 
solution X is recovered from the column The infinite-dimensional extension 

of the Sylvester’s theorem as well as the integral formula for the unique solution X 
of (jl.ip is due to Rosenblum |14) . Some other explicit formulas for X G (in 

terms of A, B and G rather than their entries) can be found in the survey [11]. If 
a{A)ria{B) ^ 0, then (ll.ip has a solution if and only if rankG = rank [G c] , while 
the homogeneous equation AX = XB has d linearly independent solutions where 
the integer d = dimNul(G) can be expressed in terms of invariant factors of matrix 
pencils A and B mm)- 

Let IIn{oi) denote the n x n lower triangular Jordan block with a on the main 
diagonal: 

Jn{a) = aln + Fn, where Fn = [5i^j+i]Xj=i (1-3) 

and where is the Kronecker symbol. If A and B are Jordan blocks A = Jn{oi) 
and B = j7)X(/3), the equation (jl.ll) simplifies to 

{a- 13)X = G - FnX + XF^. 
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If a 7 ^ /3, we iterate the latter equality n + m — 1 times and arrive at 

n+m-2 . . 

X= 2 ^ (-!)*+'( jF‘C(Fjr. (1.5) 

k=^ i-\-j=k ^ ^ 

The formula (jl.5l) appeared in [13] and in a slightly different form, in [T5|. If a = /? 
and n > m (this case was considered in [l3|), the equation (ll.4p can be written 
entry-wise as follows: 

0 = ciq, Xjq = Cj+iq, xij = —cij+i, 

Xjj+i - Xi+ij = Cj+ij+i for 1 < i < n; 1 < j < m. (1.6) 

Then one can see that if the system (|1.6I1 has a solution, then necessarily 

k-l 

Cj^k-i = 0 for k = 2,...,n. (1.7) 

i=l 

On the other hand if conditions (11.71) are met, then for any choice of fixed Xn,i, • ■ ■, Xn,m, 
the system (|1.5I) has a unique solution. In other words, the bottom row in X serves 
as a free and independent parameter in the parametrization of all solutions X of the 
Sylvester equation (11.411 . 

In general, one may reduce given matrices A and B to their lower and upper 

Jordan forms (with say, k and i Jordan cells, respectively) subsequently splitting 

the equation (II.ip to ki Sylvester equations of the form (II.4p . 

In this paper, we will focus on the equation (II.ip over quaternions: 

AX-XB = C, where A G B € C G . (1.8) 

Since multiplication in IH is non-commutative, the equation (jl.Sp is not completely 
trivial even in the scalar case. The following result back to Hamilton (see e.g., |181 
p. 123]). 

Theorem 1.1. Given a,l3,c G H, the equation ax — xf3 = c has a unique solution 
if and only if 

Pa,13 | q :|^ — {a -\- (P)ld + /3^ 7^ 0 (1-9) 

and this unique solution equals x = {ac — cl3)P~p. 

Taking the advantage of complex representations for quaternion matrices [I2|,it 
is always possible to reduce (|1.8p to certain complex Sylvester equation producing 
in particular, the uniqueness criterion: the equation (II.8p has a unique solution if 
and only if the right spectrums of A and B are disjoint, see [Hi |T6] and Theorem 13.11 
below. However, further results obtained on this way and briefly surveyed in Section 
3 refer, to some extent, to complex representations of matrices A and B rather than 
the matrices themselves; see e.g., formula (j3.3p for the unique solution. 

Our contribution here are several explicit formulas for the unique solution given 
exclusively in terms of the original matrices A, B, C in the cases where (1) A and B 
are Jordan blocks (Theorem l3. 21 presents the quaternion analog of formula (jl.5p ). (2) 

A is two-diagonal (Theorems 13.31 and 13.4p . and (3) A is lower-triangular (Theorem 
13.8h . Making use of canonical Jordan forms for A and B (see [20] and Theorem 12.11 
below) one can reduce the general case to the one where A and B are Jordan cells. 
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The core of the paper is the study of the singular case (the right spectrums of 
A and B are not disjoint). Special attention is given to the case where A and B 
are two-diagonal matrices (see formulas (I2.21[) i. As will be explained in Section 2.4, 
Sylvester equations with A and B of this form arise in the context of polynomial 
interpolation over quaternions and the results on such special Sylvester equations 
are needed to explicitly describe quasi-ideals in the ring of quaternion polynomials. 
In Section 4 we present necessary and sufficient conditions for the equation to have 
a solution (Theorem I4.5jl which become more transparent if A and B are Jordan 
blocks (Theorem 14.71 presents the analog of conditions (jl.7j) from |13j). Also in 
Section 4, we present an algorithm for constructing a solution to the (solvable) 
equation (jl.SI) . In Section 5, we parametrize the solution set of the homogeneous 
equation AX — XB = 0; the parametrization contains min(m, n) free independent 
parameters, each one of which varies in a two-dimensional real subspace of H. In 
case A = In{a) and B = Im{/3) are Jordan blocks, the general solution of the 
homogeneous equation is a “triangular” Hankel matrix all entries of which satisfy 
the homogeneous scalar Sylvester equation ax — = 0 fCorollary 15.2p . 

2. Preliminaries 

In this section we collect basic facts to make presentation self-contained. We 
hrst fix notation and terminology. By HI we denote the skew field of quaternions 
a = xq + ixi +]X 2 + kx 3 where xq, xi, X 2 , X 3 G M and where i, j, k are the imaginary 
units commuting with R and satisfying = k^ = ijk = —1. For a G HI 

as above, its real and imaginary parts, the quaternion conjugate and the absolute 
value are defined as Re(a) = xq, Im(Q;) = ixi J-jx 2 J- kx 3 , a = Re(a) — Im(a) 
and lap = aa = |Re(a)p + |Im(a)p, respectively. Two quaternions a and /? are 
called equivalent (conjugate to each other) if a = h~^/3h for some nonzero h G H; in 
notation, a ^ j3. It turns out (see e.g., a) that 

a ^ j5 if and only if Re(a) = Re(/3) and |a| = |/3|, (2-1) 

so that the conjugacy class of a given a G HI form a 2-sphere (of radius |Im(a)| 
around Re(a)) which will be denoted by [a]. It is clear that [a] = {a} if and only if 
a G R. 

A finite ordered collection o: = (ai,... ,a„) will be called a spherical chain (of 
length n) if 

ai ~ 02 ~ ... ~ afc and Oj+i 7 ^ a j for j = 1 ,..., n — 1 . ( 2 - 2 ) 

The latter notion is essentially non-commutative: a spherical chain a = (ai,..., a„) 
consisting of commuting elements necessarily belongs to the set {ai,ai} which to¬ 
gether with inequality in ( 12 . 2 h implies that all elements in a are the same: 

Q = (a, a,... , a), a G HI. (2.3) 

2.1. Quaternion matrices. We denote by HI"'^™ the space of n x m matrices with 
quaternion entries. The definitions of the transpose matrix AJ, the quaternion- 
conjugate matrix A and the adjoint matrix A* are the same as in the complex case. 

An element a G HI is called a (right) eigenvalue of the matrix A G if 

Ax = xa for some nonzero x G In this case, for any (5 = h~^ah ~ a we 
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also have Ax.h = xhh~^ah = x/i/3 and hence, any element in the conjugacy class 
[a] is a right eigenvalue of A. Therefore, the right spectrum Ur{A) is the union of 
disjoint conjugacy classes (some of which may be real singletons). The studies of 
right eigenvalues and canonical forms for quaternion matrices were carried out in 
[11|121[20]. In particular, it was shown in m that any square quaternion matrix is 
similar to a complex matrix in Jordan form. 


Theorem 2.1. For every A G there is an invertible S G such that 

k 

S~^AS = where a* G C and Imoj > 0 for i = 1,..., fc. 

i=l 

Making use of Theorem 12.11 one can reduce A and B to their lower and upper 
Jordan forms 

k i 

s-^AS = ^JnM), r-isr = 0j-T^.) 


i=l 


i=i 


and conformally decompose S ^CT = [Cij] and S ^XT = [Xij] to see that (II.8p 
splits into M Sylvester equations 

(2.4) 


ffni{oii)Xij XijfJ^Xfij) — Cij. 


This reduction suggests to study the “basic” case where A and B are Jordan blocks, 
i.e., to get the quaternion analogs of the formula (|1.5p and conditions ()1.7p . The 
defhciency of this approach is that most of explicit formulas will rely on similarity 
matrices S and T. 


2.2. Complex representations. Since each a G HI admits a unique representation 
of the form a = oi + a 2 j with 0 : 1,02 € C, any matrix A G can be written 

uniquely as ^4 = + A 2 j with ^ 1 , ^42 G The map 


A = Ai+ A2j t-S' ip{A) 


—A2 Ai 


(2.5) 


that associates to each quaternion matrix its complex representation was introduced 
in [12]. It is additive and multiplicative in the sense that 


ip{A + B) = (p{A) + ip{B) and ip{AB) = (p{A)ip{B) (2.6) 


for rectangular quaternion matrices of appropriate sizes. It was shown in [T2| that 
if A G C is an eigenvalue of 4>{A), then any element from the 2-sphere [A] C H is a 
right eigenvalue of A and that all right eigenvalues of A arise in this way (this fact 
implies that the right spectrum of an n x n matrix is the union of at most n disjoint 
conjugacy classes). Therefore, for matrices A G and B G the following 

conditions are equivalent: 


a{(p{A)) D a{(f{B)) = ^ 4=^ ar{A) D ar{B) = $. 

Observing that ]D = D] for any complex matrix D we define the map 


Ip: Y = 


Yu Yu 
Y 21 Y 22 




Y 


dm 

id'll 


Yu + Y 22 , Ti 2 - T 21 . 
-+ —^—J 


(2.7) 

( 2 . 8 ) 


2 


2 
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assigning to each matrix Y G (£^nx 2 m ^ quaternion matrix constructed from the 
blocks Yij G It is easily verified that iIj is the left inverse of (/>: 

iIj{<P{A)) = A for any A G (2.9) 

Besides, ^|J is additive and, although not multiplicative, the equalities 
i^{<j2{A)Y) =i2{^{A))i2{Y) = Ai2{Y), 

= ^{Y)B ^ ^ 

hold for any A G B G MP^i and Y G 

2.3. Quaternion polynomials. Let BI[ 2 ;] denote the ring of polynomials in one for¬ 
mal variable z which commutes with quaternionic coefficients. The ring operations 
in ]HI[z] are defined as in the commutative case, but as multiplication in H is not 
commutative, multiplication in ]HI[z] is not commutative as well. For any a G H, we 
define left and right evaluation of / at a by 

k k k 

= and ^if f{z) = '^z^fj. (2.11) 

j=0 j=0 j=0 

The formulas make sense for matrix valued polynomials / G and extend to 

square matrices by letting 

k k 

f^^{A) = '^A^fj, f^-{B) = '^fjB^ if B (2.12) 

j=0 j=0 

An element a G HI is called a left (right) zero of / if f^^{a) = 0 (respectively, 
= 0). For polynomials with real coefficients, left and right evaluations (and 
therefore, the notions of left and right zeros) coincide. The characteristic polynomial 
of a non-real conjugacy class [a] C H is defined by 

~ <a)(^ — a) = z‘^ — z{a -|- a) -|- \a\^] (2-13) 

it follows from characterization (j2.1ll that formula (j2.13l) does not depend on the 
choice of a G [a]. Since Tjq,] is the polynomial of the minimally possible degree such 
that its zero set (left and right, as A G M[z]) coincides with [a], it is also called the 
minimal polynomial of [a]. Observe that the matrix X\^a]{B) is invertible if and only 
if [a] n <Tr{B) = 0. 

Since the division algorithm holds in ]H[z] on either side (see e.g., [IH]), any (left or 
right) ideal in EI[ 2 ;] is principal. We will use notation (/i)r and {h)i for respectively 
the right and the left ideal generated by h. An ideal is maximal if and only if it is 
generated by the polynomial 

p^{z) = z — a, a G HI (2-14) 

and it follows from respective (left and right) division algorithms that 

/ G (pjr r^(«) =0, / G (p„), r{a) = 0. (2.15) 

A left (right) ideal is called irreducible if it is not contained properly in two distinct 
left (right) ideals which occurs if and only if it is generated by a polynomial p of the 
form p = ■ ■ ■ Pan some spherical chain a = (oi,..., a„) (see e.g., [2]). 
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2.4. Sylvester equations and interpolation by polynomials. The equation 
m arises naturally in the context of the following interpolation problem; given 
polynomials p, p, g, g, find a polynomial f € IHI[ 2 ;] such that 

f - 9 ^ {p)r and f -g G {p)i. (2.16) 

If p = and q = then due to equivalences (I2.15|] . conditions (I2.16|] can be 
written in terms of left and right evaluations as 

rna)=7 and r^{fi)=6 (2.17) 

where 7 = h^^{a) and 6 = g^'^(fi). The following result appears in [T]. 

Theorem 2.2. There is a polynomial f G ]HI[ 2 ;] satisfying conditions (I2.17P if and 
only if the Sylvester equation 

ax — xfi ='y — 6 (2-18) 

has a solution. If this is the case, all polynomials f G Bipz] satisfying conditions 
(j2.17l] are parametrized by the formula 

/ = 7 + PaX + Pahpfj (= <5 + Xpfi + Pahpfj) (2.19) 

when X runs through the solution set of (I2.18jl and h varies in EI[z]. 

Note that letting /i = 0 in (I2.19P gives all linear solutions to the problem (j2.17l) 

while letting 7 = (5 = 0 leads us to the explicit description of the intersection 

{P/ 3 )i n {pa)r of two maximal ideals in ]H[z]. 

Let us now assume that the ideals {p)r and (g)^ in (12.161) are irreducible, i.e., that 
p and q are of the form 

P = PaiPa 2 ---Pa„, 9 = P/3^ P/3_ i ' ' ' P/3i ( 2 - 20 ) 

for some spherical chains where a = (oi,..., On) and (3 = (fii,..., fim)- Let us 
introduce the matrices 



1_ 

0 

02 

- O 

1 

O 


r fii 

1 

0 ... 

fi2 0 

1- 

o 

II 

0 

1 



II 

0 

1 



1 

o • • • 


0 

1 

• 1—1 


1 

o • • - 

... 0 

0 

1 fim . 


and let En G be the column with the top entry equal one and all other entries 

equal zero. 

Theorem 2.3. Let p and q be as in (12.2011 and, given polynomials g{z) = 
and g{z) = 'xlth degp < degp and degg < degp, let 

^ = E - Y, EnOkEljJ. (2.22) 

j k 

Then there exists a polynomial f G IHipz] satisfying conditions (|2.16p if and only if 
the Sylvester equation 


J^X - XjJ = c 


( 2 . 23 ) 
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has a solution X = [xij] € If this is the case, all polynomials f G ]HI[z] 

satisfying conditions (j2.16|) are parametrized by the formula 

m 

f = g + '^P ■ Xn,l ■ p/3, p/3, _1 ■■■ppi+p-h-p (2.24) 

when \xn,i ■ ■ ■ iCn,m] is the bottom row of an X satisfying (I2.23P and h varies in 
IHipz]. The polynomial f in (I2.24p can be written in the form 

n 

f = 9 + '^ PaiPa2--- Pak ■ Xk,m ' P + P ' h ■ p. (2.25) 

k=l 

The latter theorem is an extension of Theorem EH Formulas (|2.19p suggest 
to conjecture that in more general setting of Theorem 12.31 all solutions / to the 
interpolation problem should be paprametrized by the “whole” solutions X to the 
equation (|2.23p rather than by their bottom rows (in formula (I2.24p l or the rightmost 
columns (in formula (j2.25p ). However, there is no contradiction here: as we will see 
in Section 4, any solution X to the equation (j2.23p is completely determined by its 
bottom row or its rightmost column. 

Letting /i = 0 in (|2.24l) (or in (j2.251) 1 gives all solutions / to the problem (j2.16p 
with deg / < degp + degp while letting g = g = 0 leads to fairly explicit description 
of the intersection {p)i n {p)r of two irreducible ideals in ]Hl[z]. The case of general 
polynomials p and p in (j2.16p can be reduced to the irreducible case upon making 
use of the Primary Ideal Decomposition Theorem and gives rise to the Sylvester 
equation (II.8p with A and B in the block diagonal form with all diagonal blocks of 
the from (12.211) . The formulation of Theorem 12.31 is presented here to explain our 
interest in the Sylvester equation with A and B of the form (12.211) . The proof of 
the theorem along with reformulations of conditions (12.161) in terms of evaluation 
functionals is given in [3]. 

3. Quaternion Sylvester equation: the regular case 

Applying the map ()2.5p to the equation ()1.8I) we observe that for any solution 
X G q£ (j£_gp _ matrix Y = ip{X) G solves the complex Sylvester 

equation 

if{A)Y-Yg,{B) = g,{C). (3.1) 

By the Sylvester’s criterion and due to equivalence (12.7p . the latter equation has 
a unique solution if and only if ar{A) n crr{B) = 0. If this is the case, we apply 
the map (j2.8p to the equation (j3.ip . Due to properties (12.9p . (|2.10l) of V' we get 
A'tpifY) — ijj{Y)B = C so that X = V’(L") is a solution to (jl.8p . It is a unique solution 
since (f is injective, so that distinct solutions to (jl.8p would have given rise to distinct 
solutions of (j3.ip. We thus arrive at the following result (see e.g., my- 

Theorem 3.1. Equation (II.8h has a unique solution (for every C G if and 

only if ar{A) ncJr(B) = 0. 
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As we have seen, the unique solution of the equation (jl.8|) is necessarily of the 
form X = 'ijj{Y) where Y is the unique solution to the complex Sylvester equation 
m- Taking any formula for Y available in literature, one gets a formula for X by 
letting X = 'ipiY). For example, if we denote by /r : the bijection 

assigning to the matrix X = [xij] the the column x as in (jl.2p . the formula for 
the unique solution X of the equation (jl.Sp suggested by the original Sylvester’s 
approach m is 

X = 0 hm - hn ® <^(S))“V(</'(C'))))- (3-2) 

The formula with minimal references to complex representations (extending the 
Jameson’s result [10] i has been established in [3116]: 

( 2n k-1 \ / 2n \ 

EE akA^CB^-^- ‘ E“i^' • (3-3) 

k=lj=0 ) \i=o / 

where qq, , a 2 n € C are the coefficients of the polynomial 
det(A/ 2 n — =0.0 + Ol-^ + . . . + 


In the rest of the section we examine how far one can advance making no use of 
complex representations of quaternion matrices. We start with a very special case 
where A and B are Jordan blocks and establish the quaternion analog of the formula 
m- On the other hand, this result generalizes Theorem ll.il 


Theorem 3.2. Let A and B be Jordan blocks A = J^n{o() and B = for some 

a 9 ^ /3. Then the equation d+HD has a unique solution 


n+m—2 /k+1 

E E(-') 

k=0 \i=0 

where Pa ,/3 is given by dlSI) and 


k+i I ^ ^ \ —k—i+1 ji/T ai \ u—k—1 




(3.4) 


Mk = ^{-lY(''^Ft^C{Fj,Y for k = 0,...,n + m-2. 
i=o ^ ^ 


Proof. Observe that Pa^g = T’[q,](/ 3) (the value of the characteristic polynomial (12.1311 
at /3) and thus, P^^p 7 ^ 0 if and only if a 9 ^ /3. As A = aln + Fn and B = (din + 

(see (lEl), the equation (|1.8D takes the form 

aX -X/3 = C- FnX + XFY- (3.5) 

We now subtract the latter equation multiplied by (3 on the right from the same 
equation multiplied by a on the left: 

|apX - (a + a)Xl3 + X/3^ = a{C - FnX + XF^) - (C - FnX + XPj()/3. 

Since |ap and (a + a) are real and therefore commute with all quaternions, the 
latter equality can be written as 

XP^^p = aiC - FnX + XFY) -{C- FnX + XPjj/?, 


(3.6) 
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in view of (ll.9p . Thus, equation (|3.6jl follows from (I, '1.51) . On the other hand, we 
may subtract the equation (j.S.hp multiplied by [3 on the right from the same equation 
multiplied by a on the left; 

= {C- FnX + XFl)P^^p. 

Since Pa,(3 7^ 0 commutes with /?, we may cancel Pa,p in the latter equation arriving 
at (|3.5jl . Therefore, equations (j3.5jl and are equivalent. Since Pa,13(3 = l3Pa,i3 
and since the matrices Fn and Fm are real, we can iterate the equality (13.61) as 
follows; 

=iac - C/3)P^^J - + aXP^JPl 

+ FnXP(J/3-XP(J^FZ(3 
={aC - CI3)P:(,^ - a\FnC - CF^)P:J 
+ 2a{FnC - CFl)l3P(J - {F^C - CP^)/32p-/ 

+ Fn{oF{FnX -XFl)- a{FnX - XFl)(i)P(J 

- ia\FnX - XFl) - a{FnX - XpT)/3)P„7/pT 

- Fn{a{FnX - XFl)l3 - (P„X - XF^)f3^)P^^J 
+ {aiFnX - XF;^)/3 - {FnX - XFj,)f3^)P^^jFj^. 

Continuing this iteration and letting r = m + n—1, we get after r — 1 steps 

^~2 /fc +1 y, \ 

XP^+n-i = ^(_l)fc+0 + p^+n-k -2 ^ (3 7 ^ 

fc=o Vi=o V * / y ’ 

where R contains the terms containing factors F^ and {F^y of total degree £ + j> 
m + n — 1. Thus, either £> n in which case F^ = 0, or j > m in which case 
{F^y = 0. Hence all terms in R are zero matrices (i.e., P = 0), and formula (13.41) 
follows immediately from (13.7|) . Since (|3.6|) is equivalent to (|3.5I) . each iteration of 
(j3.6D (and in particular, the formula (13.41) 1 is equivalent to (13.5D . Therefore, X of 
the form (13.41) is a solution to (j3.5p . The uniqueness of a solution is evident. □ 

In Theorems 13.31 13.41 and 13.81 all restrictions on B will be removed. The next 
theorem settles the case where A = Ja is of the form (12.211) with all diagonal entries 
be in the same conjugacy class. To formulate the theorem, let us note that for the 
matrices 



■ «! 

1 

0 

0:2 

0 

0 ■ 


Ol 

-1 

0 

02 

0 

0 ■ 

A = 

0 

1 



, A' = 

0 

-1 




0 


0 

0 

1 Oin - 


0 


0 

0 

1 _ 


based on ai,..., a„, from the same conjugacy class, we have 

A + A' = {ai+ ai)In and A'A = \ai\'^ R — A, 


(3.9) 
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where A = [(a^ - ai+i)dij+i + 5jj+2]"j=i; more explicitly: 


0 0 

Ol\ — 0.2 0 


A = 


1 02 — 03 

0 1 


0 1 


: 0 0 
0 ... 01 On-l — On 0 _ 

Relations p.9ll follow from characterization (12.ip . due to which \ai 
Reoj = Real for z = 2 ,..., n. 


(3.10) 


|aip and 


Theorem 3.3. Let ai, ..., a^ he the elements from the same conjugacy class R C H, 
let the matrices A, A' and A be defined as in (13.8p . (I3.10p . and let B G 
such that V (1 Or{B) = ih. Then the equation (II. 8 h has a unique solution 

n—1 

X = '^1'^ {A'C-CB) {Av{B))-’^-\ (3.11) 

k=0 

where Xy is the characteristic polynomial of the conjugacy class V. 

Proof. We multiply equation (II. 8 p by A' on the left and by B on the right 
A'AX - A'XB = A'C, AXB - XB'^ = CB, 
and then subtract the second equality from the first. Due to (|3.9I) . we get 
|aipX -AX - (ai + ai)XB + XB^ = A'C - CB, 
which can be written in terms of the characteristic polynomial (| 2 . 2 I) as 

XXv{B) = A'C-CB + AX. (3.12) 

We now iterate (I3.12p as in the proof of Theorem 13.21 

X{Xv{B))^ ={A'C - CB){Xv{B)Y-^ + AX{Xv{B)Y-^ 

= {A'C - CB){Xv{B))'^-^ + A{A'C - CB){XviB))^-^ 

+ A‘^X{XviB))'^-^ = ... 

n—1 

= '^A^ {A'C - CB) (Ty(R))"-^-^ + A^X. 

k=0 

The last equality implies (13.111) since A^ = 0 (see (13.101) 1 and Xy{B) is invertible, 
since V D ar{B) = 0. □ 

We next remove the assumption that all diagonal entries in A are in the same 
conjugacy class. In this case, we get explicit formulas for the rows of a unique 
solution X. In what follows, we will use the noncommutative product notation 

rv 

k k 

Wli ■■= lil 2 ■■■ Ik-, and 7i := 7fc • • • 7271 
i=l i=l 


(3.13) 
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Theorem 3.4. Let A = Ja G he defined as in (j2.21jl . let B G he such 

that afiB) n [a^] = 0 for k = 1,... ,n, and let Cj denote the j-th row of the matrix 
C G Then the equation (II. 8 h has a unique solution 

X = Coli<fc<„Xfc, (3.14) 

with the rows given hy 

r\ 

- E ^ (B) ■ (3.15) 

j=i i=j 

for k = 1,... ,n, where the polynomials p-^. and T[q,.] are defined via formulas (I2.14p 
and (I2.13p . respeetively. 

The proof is based on the following two observations. 

Remark 3.5. Let f and g he two quaternion matrix polynomials and let B he a 
square matrix. Then 

{fgr{B) = {f-g^^{B)r{B). (3.16) 


Indeed, since evaluations (|2.12p are linear, we have 

{f-g--{B)r {B) = {f-Y,9jB^r{B) 

j 

= Y.fk{Y.gjB^)B>^ 

k j 

= E( E fkgj)B. = {fgr{B). 

i j^k=i 

Remark 3.6. Let he given hy (|2.14l) for a fixed a G HI and let 

{p^Or {B) = M (3.17) 

for some D,M ^ and B G such that (Jr{B) n [a] = 0. Then 

D = (p^UnB) ■ {A:[^]{B))-\ (3.18) 


Indeed, applying (13.1611 to f = p^ and g = p^D and taking into account equalities 
Pa Pa = ‘^[a] and (I3.17p . we get 

= {p^-{p^DnB)r^ (b) = (p^nr (b). 

Since 7f[Q,] is a polynomial with real coefficients, we have 

iX[^^Dr-iB) = (DX^^^riB) = Z1T[„](S), 


and (j3.18p follows from the two latter equalities, once we recall that the matrix 
X[ai]{B) is invertible (as [a] = 0). 



















12 


V. BOLOTNIKOV 


Proof of Theorem \3.4\ Equating the corresponding rows in the equation (II.8|) we 
get, due to (12.211) and (I.I.Mj) . the system 

aiXi — XiB = Cl, akXk — X^B = Ck — Xk-i (/c = 2,..., n), 

which is equivalent to (jl.8jl . The latter equalities can be written in terms of right 
evaluations as follows; 

(p„^Xi)"’'(S) = -Ci, [p^^Xky\B) = Xk-i-Ck ik = 2,...,n). (3.19) 

Making use of Remark 13.61 we solve the leftmost equation in (I3.19h : 

(R) • 

confirming formula (13.151) for Xi. Similarly, we solve the A:-th equation in ()2.5p for 
Xk-. 

Xk = {p^,iXk-i - Ck)y^ {B) • (T[„^](R))-1. (3.20) 

Assuming that the formula (13.151) holds for Xk-i, we plug it into (|3.20l) : 

Xfc = -(p^^Cfc)"’’(i?)-(A'[„^](R))-i 
^ fc-i 

“ {Pak-l " ' (B) ■ 

\ 

= - {paAy^ (B) ■ 

r\ 
k 

- E ■ ■ ■ lk!,Ci) ' (-B) ■ n(-rM{B))"' 

j = l i=j 

k ^ 

= - E (fe. ■ ■ ■ feVj)'' (B) ■ II(^K1(B))-‘ 

j=l i=j 

and the induction argument completes the proof of formulas ()3.15l) . Note that the 
second equality in the last calculation followed by Remark (3. 5 1 applied to polynomials 
/ = Pat 9 ~ Pak-i Pa s-iid since the characteristic polynomial A[q,.] is in 
]R[z] for i = 1 ,..., n. □ 

Remark 3.7. If the matrix A = Ja is based on the elements ai,...an from the 
same conjugacy class V, then formulas (|3.15l) simplify to 

k 

= - E {p^kp^k-^ • • • p^AT 

i=i 

Making use of Remark 13.61 one can get the formula for the unique solution of the 
equation (II.8j) in the case where A is lower triangular. 


rv 

k-1 


n(-^M(^))'' (B) 


i=j 
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Theorem 3.8. Let A = ® lower triangular matrix (uij = 0 for i < j) 

and let B E such that (7r{B) n [afc,fc] = 0 for k = Then the 

equation has a unique solution X of the form (I3.14p with the rows defined 

recursively by 

(^) • 

^k = - (B) • (3.21) 

fe-i 

+ (^) • k = 2,...,n. 

i=i 

Proof. Equating the corresponding rows in the equation (ll.8h we get, due to the 
lower triangular structure of A, the equations which can be written in terms of right 
evaluations as 

fc —1 

(B) = -Cl, [p^.X^y^ (B) = -Ck + 

i=i 

for k = 2,... ,n. The rest follows by Remark 13.61 □ 

To keep the presentation symmetric we conclude with the “column” version of 
the last theorem. 

Theorem 3.9. Let B = be an upper triangular matrix ((3ij = 0 for i > j) 

and let A E be such that cJr(^) n [Pk^k] = 0 for k = 1,... ,m and let C = 

[Ci ... Cm] ■ Then the equation (|1.8I) has a unique solution X = [Xi ... Xm] 
with the columns defined recursively by 

X, ={A:^f,^ JA))-^ ■ (c,f>p^ y\A), 

Xk • {Ck\,r (^) (3-22) 

k-l 

+ for k = 2,...,m. 

i=i 

Proof. Let us observe that the left and right evaluations (I2.12p are related as follows: 
for any a E H, O E and B E 

{B) = CB-aC = {B*C* - C*a)* = [{C*p^f^ {B*)]* . (3.23) 

Taking adjoints (|1.8I) we get the equation 

B*X* - XM* = -C*, (3.24) 

and since the matrix B* is lower triangular, we can apply Theorem 13.81 (with A, B, 
C replaced by B*, A* and —C*, respectively) to get recursive formulas for the rows 
of the matrix X*. Taking adjoints in these formulas and making use of relations 
(j3.23l) . we get ()3.22p . □ 
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To conclude, we remark that in case B = where is of the form (12.211) . the 
recursion (j3.22p can be solved to produce formulas 


w=En(%iO))^‘ 

j=li=j 



(1 < 1 < m). 


4. Singular case: the solvability criterion 


In this section, we consider the Sylvester equation (jl.Sp in the case where crj.{A) D 
(Tr{B) 7 ^ 0. At certain level, this case also can be handled by making use of complex 
representation of quaternion matrices: one can pass from (jl.Sp to the equivalent 
complex Sylvester equation (|3.ip . then use any avaliable method (e.g., Sylvester’s 
tensor-product approach recalled at the beginning of the paper) to describe all its 
solutions, and then claim that the formula X = ipiY) describes all solutions to the 
Sylvester equation (11.81) when Y runs through the set of all solutions to the equation 
m- This approach is not quite satisfactory, partly because the map i/) (123]) is not 
injective. Presumably, some quantative results still can be obtained on this way: to 
define the number of independent conditions (scalar equalities) which are necessary 
and sufficient for the equation (|1.8p to have a solution along with the number of 
independent free parameters needed to parametrize the solution set. In this section 
we will obtain more definitive results of this sort (explicit solvability conditions and 
parametrization of all solutions) in the case where the matrices 


^ ^OL — T 1 


B = JJ 




m 

ij=l 


(4.1) 


are based on spherical chains a = (ai ,..., an) and /3 = (/3i,..., /3m)- We assume 
without loss of generality that n > m and start with the scalar equation 


ax — xl3 = c, where a f3. (4.2) 

Let us recal that a unit element / G BI is purely imaginary if and only if = — 1. 
Therefore, the characterization m can be reformulated as follows: a ^ 13 if and 
only if a and f3 can be written as 

a = X + yl, (3 = X + yl (x € M, y > 0, /^ = /^ = —1). (4-3) 

Since HI is a (four-dimensional) vector space over M, we may define orthogonal com¬ 
plements with respect to the usual euclidean metric in M^. For a and /3 as in (|4.3I) . 
we define the plane (the two-dimensional subspace of HI = M^) via the formula 

{ span{l,I} = {u + vl : n, u € M}, if /3 = a, 

{span{l,I})^, ^ if /3 = a, (4.4) 

span{I + I, l — II}, if (3 ^ a,a. 

Since a = x — yl, it follows that = Ila^Q, = 11^,0 and 

1 ^ 0,/3 = span{I — 1,1 + 11} if (3 ^ a, a. (4-5) 

Lemma 4.1 ([I]). Given a /3, the solution set of the homogeneous Sylvester equa¬ 
tion ax = xj3 coincides with 11^^^ given in (14.4h . Furthermore, the non-homogeneous 
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equation ra has a solution if and only if ac = eft (equivalently, c € which 

ease the solution set is the affine plane 

{2lm[a))~^c + Iia,i3 = -c(2Im(/3))“^ + 11^,^. 


We now proceed to the matrix equation 

AX-XB = C, X = [xi,] = [Xi ... Xm] , (4.6) 

where A and B of the form (j4.ip are based on the spherical chains a and (3 from 
the same conjugacy class V and where we assume without loss of generality that 
m < n. With the given matrices A, B and C, we associate the matrix 

D :=CB- A'C (4.7) 

where A' is given in (j3.8ll . In more detail, if C = ^i,o — 

coj = 0 for all i,j, then 

^ ~ ~ ~ + Q-lj- (4-8) 

We next introduce the elements € HI by the following double recursion 

^k,j = (afc — a/e+i) ^ [dk+i,j + rfc+ij _2 — — Tk+ij-iiftj — /3j-i)] (4.9) 

with the initial conditions 

r_ij = Toj = = 0 for all k,j>l. (4.10) 

It is clear from (14.9p that the assumption that ci is a spheerical chain (i.e., that 
Oik 7 ^ oik+i) is essential. We make several further comments. 

Remark 4.2. (1) Recursion (|4.9I) determines for all positive k < n and j < m 
such that k + j < n. 

( 2 ) Any element from the hrst “column” is determined by the elements 
(1 < r < k). 

(3) Any element in the Ath counter-diagonal Vi = {r^j : A; -|- j = .^ -|- 1} can be 

expressed in terms of the elements from the previous counter-diagonal : 

k + j = (f\ and one fixed element in Bi. The latter follows from the formula (j4.9p 
since fij ^ ftj {(3 is a spherical chain). 

Lemma 4.3. Let F^ j be defined as in (|4.8p - ()4.10p and let 

'S'i = +rij-i(^i-i-^j) + rij -2 (j = l,...,m). (4.11) 

If Sj = 0 for j = 1,... ,m, then 

^k,j • oy.jAi'kj ^k,jPj Okj T F^_]^j 0 (4.12) 

for k < n and j < m such that k + j < n. 

Proof. We first observe that for a ^ ft, 

a(a — ft)~^ = {a — ft)~^ft, a — ft = ft — a. 


(4.13) 
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By M^ dTOD . 

r*;,! = (oik — — rfc-i,i] 

= (afc — [cfc+i,i/3i — ak+iCk+i,i + Cfc,i — rfc-i,i] • (4-14) 

Making use of the first equality (with a = ak and /? = afc+i), we have, on account 
of (j4.7l] and (j4.14p . 

=Oik^k,i — rfc,i/3i — Cfc^i + rfc_i^i 

={ak — afc+i)“^ [oik+i (ca;+i,i/3i — a^+iCfc+i,! + Ck,i — Bfc-iq) 

“ (cfc+i,i/3i — afc+iCfc+i^i + Cfc^i — rfc_i^i) /3i] — Cfc^i + rfc_i^i 

={otk — afc+i) ^ [—Cfc+id:’[Q,j.^j(/3i) +afcCfc^ — 

—afcBA:-!,! + Vk-l,lfil] 

=^k+i — a.k)~^ [dk,i — Ck-i,i + akTk-1,1 — Tk-i,jl3i ], (4-15) 

where the last equality follows since jdj ~ otk+i so that d:^[afc+i](/5i) = 0) due to 
formula (I4.8j) for dk,i- Letting A: = 1 in (j4.15|i and taking (j4.6[) into account, we get 

Ai,i = (a2 - = (02 - (4.16) 

For /c > 1, we have 

afeLfc-ij — rfc_ij/3j = Afc_i^i + (ofc — afc_i)rfc_i^i + Ck-i,i — rfc_2q 

= Afc-iq + Cfc_iq — dfc,! 

where the first equality follows from formula (j4.12ji for Afc_i^i, and the second equal¬ 
ity follows from (j4.14p (with k replaced by A: — 1). Combining the latter equality 
with ()4.15p gives 

Afc,i = (afc+i — aA:)~^AA;_i,i, 

from which, on account of (I4.16jl . we recursively obtain 

Afc,i = (ofc+i — afc)~^(afc — Oik-i)~^ ■ ■ ■ (02 — ai)~^'S'i. (4-17) 

Since Si = 0, it follows that 

Afc,i := akTk,i — Tk^iPi — Ck,i + Ffc-i^i = 0 (A: = 1,..., n — 1). (4-18) 

We now assume that j > 2. Making use of the first equality (with a = at and 
(3 = ak+i), we have, on account of (14.9p . 

afcFfcj — (4-19) 

= OLk{ak — a/c+i) ^ [dk+i,j + rfc+ij_2 — Lfc-ij — Tk+ij_i{j3j — /3j-i)] 

(cifc Olk-\-l) -|- Y'k-\-lJ—2 ^k—l,j ^k+l,j — lil3j /^i — 1 )] fdj 

= (ak — ak+i)~^ [ak+idk+ij — dk+ijPj + ak+i^k+i,j-2 — ^k+i,j-2l3j 

c^k+i^k—i,j 4“ ^k—i,j/3j (cifc+iFfc-i-ij_i T'f^^i j_i/3j—i)(^/3j . 
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Observe that in view of (j4.8p and (j2.13j) . 

o^k+idk-\-i,j f3j —cifc-i-i(cfc-|_ij’ -|- + cj^ j) 

(cfc+ij/5j cifc-i-iCfc-i-ij’ -|- -|- Cfcj)/3j 

=Ck+l,j^[ak+-L]ildj) + «fc+l(Cfc+lj-l + Cfcj) 

~ (cfc+ij—1 + Ckj)j3j 
—cifc+i (cfc+ij—1 “b Cfcj) (cfc+ij—1 ~\~ 

where the last equality follows since fij ~ afe+i so that (/3j) = 0. We next 

observe equalities 

Clfc+lCfc+l j —1 C.k+l,j—lld j — \ — Cfc+lj —l/ 5 j —1 C^k+l^k+lJ — l 

— C^fc+lj —1 Cfc+lj—2 CkJ—1 
ClfcCfcj' ^k,jPj — -|- Ck—ij dk,ji 

which follow from (I4.8jl and the fact that the elements from the same conjugacy 
class have the same real part. Combining the three last equalities gives 

(^k+ldk+l,j dj«-\-ijPj — dk-\-lJ — l dk,j Cfc+lj —2 4“ Ck—lJ 

+ (afc “ Oik+i)ckj + Cfc+ij_i(/3j — Pj-i). 

We now substitute the latter equality into (I4.19[) and then (I4.19p into the definition 
(j4.12p of Akj to conclude 

^k,j —ckfeTfej Cfcj’ -|- 

— {p^k ®fc+l) — 1 dk,j Ck+l,j—2 4“ Cfc—Ij 

4- (ofe ~ (^k+i)ck,j + Cfc+ij-i(/3j — Pj-i) 4- ak+i^k+i,j-2 

(®fc+irfc-)-i j— 1 
^k,j 4“ j 

(cifc ®fc+l) —1 dk,j Cfc-|_ij_2 4“ Ck—lJ 

— ak^k-l,j + ^k-l,jPj + aA:+irfc+ij_2 — Tk+lJ-2l3j (4.20) 

(cifc+irfc-)-i j—1 Y 

Letting j = 2 in the latter equality gives 

^k,2 =ioik — Ofc+l) ^ — dk,2 + Cfc-1,2 — afcrfc-1,2 4- Yk-l,2ld2 

— (afc+iLfc+i,! — Lfc+i^i/Si — Cfc+iq)(/?2 — /3i)] — r^^i. (4.21) 

Taking into account the first equality in ()4.14p and equality 

ttfe+iLfc+i,! — Tfc+i^i/Si — Cfc+iq = —Tfc^i 

which is a consequence of (I4.18[) (with k + 1 instead of k), we simplify (j4.2ip as 
follows: 


Ak,2 ={oik+i — Oik) ^ [dk,2 — Cfc-1,2 4- akYk-1,2 — Yk-i,2P2 
+rM(/3i-^2)-rfc-i,i]. 


(4.22) 
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Letting fc = 1 in (j4.22p gives, on account of (I4.11j> . 

^1,2 = (a2 — «i) ^ [c?i,2 + —/32)] = (a2 — «i) ^‘S'2. (4.23) 

On the other hand, if A; > 2, then 


(afc-l — «fc)rA:-l,2 = dk^2 — rfc_2,2 + rfc4(/3l — ,^ 2 )) 
by formula (14.9p (for k replaced by A: — 1) and therefore, by the definition (14.121) for 
^fc- 1 , 2 , 

Aa:-1,2 ={ak-l — ak)Tk-l,2 + afcrfc_l,2 — ^k-l,2P2 — Cfc-1,2 
+ rfc- 2,2 — 

=dk,2 + rfc4(/?i — /32) + afcrA;-l,2 — ^k-l,2P2 “ CA:-1,2 “ 
which, being substituted into (j4.22D . leads us to 

Afc ,2 = (ofc+i - ak)~^Ak-i ,2 for A: = 2,..., n - 2. (4.24) 

We now recursively obtain from (14.241) and (|4.23l) that 

^k,2 = (ttfc+i — afc)~^(afc — afc-i)~^ • • • (^2 — ai)~^'S' 2 , 
and since S 2 = 0, it follows that 2 = 0 for A: = 1,..., n — 2. 

The rest will be verified by induction in j. Let us assume that 

Afc/ = 0 for all A; < n and i < j such that k + £ < n. (4-25) 

In particular. 


so that 


Afc+ij —1 — o:k-\-i^k+i,j—i ^k+i,j—i/dj—i 

('k+lj—l T Yk-\-lj—2 0, 


Oik+iYk+i,j—i Yk-\-ij—i/3j—i — Yk-\-ij—2 L/j j_i. (4.26) 

Substituting the latter equality into (14.201) gives 

^k,j (*Tfc *TA:+i) [^fc+l,j —1 dk,j Ck-\-lJ—2 T ^k—l,j 

— afcLfc-ij + Tk-ijf3j + a!A:+irfc+ij_2 — Tk+ij-2Pj 

— (rfc+ij-2 — Yk,j-i){(dj — (dj-i)] — Lfcj-i 

(*Tfc <Tfc+l) [dk+lj—l dk,j Ck-\-lJ—2 T Ck—lJ 

— (^kYk-ij + Tk-ijPj + ak+iYk+ij-2 — ^k+i,j-2(dj-i 

+^k,j-iWj — Pj-i) — (ak — afc+i)rfcj_i] . (4.27) 

By the definition ()4.12p of Ak+ij- 2 , 

Otk+l^k+l,j-2 — ^k+l,j-2ldj-i =Afc+lj_2 + Cfc+ij_2 — Yk^j-2 

+ rfc+l,i-3 + ^k+l,j-2{(dj-2 — 
and since, by formula (j4.9h (with j — 1 instead of j), 

rfc+i,j-2(/3j-2 — fdj-\) ={oik — afc+i)rfcj-i — dfc+i,j-i 

“ rfc+i,j-3 + 
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combining the two latter equalities with the assumption (14.251) gives 

ak+l^k+l,j-2 — rfc+lJ-2/^j-l + dk+l,j-l — Cfc+lj-2 “ (cifc “ ak+l)TkJ-l 
= Tfe-ij-i — ^k,j-2- (4.28) 

Substituting the latter equality into (|4.27p gives 
^k,j (®fc+l ®fc) [dkj ^k—l,j 4“ 

+rfcj-i(/3j-i — Pj) — Ffc-ij-i + Tkj-2\ ■ (4.29) 

Letting fc = 1 in (I4.29P we get, on account of (14.lip . 

=(«2 — ai)~^ [dij + rij_i(/3j_i — j3j) + rij_2] 

=i^2 — Sj. (4.30) 

If A: > 2, then by the definition (14.121) of ^k-i,j and by formula p4.9p (with k — \ 
instead of /c), we have 

^k—l,j (cifc—1 Olk)^k—l,j 4“ Qffchfc—ij' T'ig_i jf3j 

Ck—l,j 4“ ^k—2,j 

dk,j 4“ r^j_2 T'j f5j—\) 4“ j T'j«_i jl3j 

Ck—i,j L/j—ij—i (4.31) 

which together with (14.301) implies 

= (ofc+i - afc)"^Afc_ij for k = 2,...,n-j. (4.32) 

We now recursively obtain from (I4.32p and (I4.30p that 

Afcj = (afc+i — afc) ^(ofc — afc-i) ^•••(02 —ai) ^Sj, 

and since Sj = 0, it follows that = 0 for k = l,...,n — j. The induction 

argument completes the proof of the lemma. □ 

Remark 4.4. The elements defined as in (|4.12p satisfy relations 

^k,j = (ofc+i — ak)~^ [Ak+ij-i(/3j — fij-i) — Ak+ij-2 + Ak-ij] (4.33) 

for k >2 and j > 3. 

Proof. Although we do not assume equalities (I4.11h to be in force, formula (I4.20p 
still holds true. Without assumptions (|4.25l) . equality (14.2611 gets the extra term 
Ak+ij-i on the right, so that formula (I4.27h takes the form 

^k,j — (dfc ®fc+l) [dk+l,j — l dk,j Cfc-|_ij_2 4“ Cfc_ij 

— ^k^k-l,j 4- Tk-ljfij + ak+lTk+l,j-2 — ^k+l,j-2fij-l 

— (Afc+ij -1 — — fij-i) — {oik — afc+i)rfcj_i] . (4.34) 

For j > 3, we use equality (14.2611 with the extra term Ak+i,j -2 on the right; substi¬ 
tuting this modified equality into (I4.34p gives the following modification of (14.291) : 

^k,j —(®fc+l Clfc) [dk,j Ck—lJ 4“ CXkTk—lJ ^k—l,jfij ^k—l,j — l 

4-(Afc+ij_i — TkJ-l){/3j — fij-l) + ^k,j-2 — Afc+ij_2] . 

Combining the latter equality with (14.311) (in case k >2) gives (|4.33p . 


(4.35) 

□ 
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Since Fi^i = (ai — 02 ) ^^ 2 , 1 ; we derive from (j4.34p 

^ 1,2 = (^2 — ai) ^ [di^2 + (‘^ 2,1 — ri,i)(/32 — A)] , 

which, on account of formulas (14.111) and (|4.17p for S 2 and A 2 ,i, respectively, can 
be written as 

Ai,2 = (02 - ai)“^5'2 + {02 - ai)“^(a3 - a2)“H«2 - ai)~^Si{l32 - A)- 

As expected, the letter formula coincides with (14.231) in case Si = 0. For j > 3, we 
have from (j4.35p and formula (|4.1ip for Sj, 

Aij ={a2 - ai)“^ [dij + (A 2 J -1 - Tij-i){j3j - (3j-i) + rij_2 - A 2 J- 2 ] 

=(a2 — oil) ^ [5'j + A2j-i(/3j — /3j-i) — A 2 J- 2 ] , (4.36) 

which is the analog of (l4.3Up . It is now clear from (|4.33l) and (j4.36p that A^ j is a 
two-sided linear combination of 5i,..., with left and right coefficients depending 
respectively, on a and /3 only. However, explicit formulas for A^^- in terms of 
Si,... ,Sk are quite complicated. 

Theorem 4.5. Given matrices A, B (based on spherical chains a = {ai,... ,an) 
and (3 = (/3i,... ,/3m) ('m < n)from the same conjugacy class V (ZM.) and C as in 
(14.6p . let dij and Fjj be defined as in (I4.7p - (l4.in|) for k < n and j < m such that 
k + j < n. Then 

(1) The Sylvester equation (14.6p has a solution if and only if 

-/?j) + rij _2 = 0 for j = l,...,m. (4.37) 

(2) For any solution X = [xij] € to the equation (14.61) . Xij = Fjj for all 

i < n and j < m such that i + j < n 


Proof. Let X G gg^^igfy (| 4 , 6 p , yve now verify that 


AX -XB = D (4.38) 

where A', A and D are defined in (13.81) . (j3.10l) and (14.71) . respectively, and where 
' 0 /3i-fi2 1 0 ... 0 

0 0 /32-fi3 : 



0 

1 


(4.39) 


0 0 


0 fim—l firri 

0 0 


We start as in the proof of Theorem 13.31 multiplying the equation (|4.6I) by A' on 
the left and by B on the right and then subtracting the second equality from the 
first we get (|3.12l) . For B of the form (14.11) . 

XviB) = B^- 2Refii ■ B + |/3i|2/^ 

= [Xvifii)6ij + ifii — /3j_|_i)(5i+ij -I- Si+2,j]ij^i: 

and since fii G H for i = 1,..., n, we conclude that Xv{B) = B] see (I4.39h . Now 
(j3.12l) takes the form (14.381) . 
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Let Xj and Dj denote the j-th column in X and D, respectively: 

Xj Col]^</;;<^X/^ j , Dj C0l]^</;;<^d/^ j . 

Taking into account the explicit structure (j4.39l) of B, we now equate the corre¬ 
sponding columns in (|4.38p : 


AXi = Di, (4.40) 

AX2 = D2 + X^{(3i-P^), (4.41) 

AXj = Dj + X,_i(/3,_i - 'Pj) + Xj _2 (3 < j < m). (4.42) 


Making use of explicit formula (j3.10p for A we equate the top entries in (14. 40 1) and 


where 


0 (that is, the first condition 

in (IT^L 

(|4.40p we eet 







iCl,l 


d2,i 



2 : 2,1 


dz,! 


K 


— 




1,1 


dn,l 


ai — 1x2 

0 




1 

0:2 — 0:3 


= 

0 

1 




0 


0 

1 


In-l 


(4.43) 


from which we conclude 


= {oii — 02 ) ^^ 2 , 1 , 

Xk,i = {ak-ak+i)~^ {dk+i,i-Xk-i,i), 2 < k < n - 1. (4.44) 


Comparing (14.4411 with (I4.14p . we see that xiq = Ti^i and that recursions defining 
Xk,i and Tfc^i in terms of Xk-i,i and Tfc-i^i, respectively, are identical. Therefore, 

Xk,i = '^k,i for k = l,...,n-l. (4.45) 

Making again use of formula (13.1011 for A we equate the top entries in (|4.41ll and get 


c^l,2 + Xi^i{/3i — P 2 ) — *^1,2 + rip(/3l — /32) — 0 

(the second equality holds since xiq = Ti^i, by (j4.45p ). which is the second condition 
in (j4.37p . Equating other corresponding entries in (j4.40p and taking into account 
(j4.45p . we get 



2 : 1,2 


^ 2,1 




^ 2,1 


' r 2 ,i ■ 

A„ 

Xn- 2,2 

= 

dn-l,l 

+ 

1,1 

(/3i-/?2) = 

dn—1,1 

+ 

Ln-l,! 


Xn—1,2 


dn,l 


^71,1 


dn,l 


^n,l 


W1-P2), 
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where is given by (I4.4.S|) . Taking into account the two-diagonal structure (j4.43p 
of An, we derive from the last equation 

xi,2 = {ai - a2)~^{d2,i + ri_i(/3i - (d^)), (4.46) 

Xk ,2 = (otk — afc+i) ^ {dk+ 1,1 + rfc+iq(/3i — /? 2 ) — rfc- 1 , 2 ) j 2 < k < n — 2. 

Comparing ()4.46l) with (14.911 . we see that xi ^2 = ri ^2 and that recursions defining 
Xk ,2 and Tk ,2 in terms of Xk- 1,2 and rfc_i^ 2 , respectively, are identical. Therefore, 

Xk ,2 = '^k ,2 for fc = l,...,n-2. (4.47) 

We now choose an integer £ (3 < £ < m) and assume that conditions ()4.37l) hold for 
all j = 1, ...,£— 1 and that Xkj = for all j < £ and k = 1 , ... ,n — j. Equating 
the top entries in (I4.42p (for j = £) we get 

0 = di^i -I- xi^£_i(/3^_i — I3() -I- xi^i-2 = di^i -|- ri^£_i(/3£_i — 151) + rp£_2 

which is the £-th condition in (I4.37p . Equating all other entries in the same equation 
gives 


^ 1 / 


d2,e 




1 

to 

1 

to 

_1 


= 




{fdg-i — (5g) + 


^n—l,£ 


1 






which implies (similarly to (I4.44p and (14.461) . 

xi,e = (w — ae+i)~^{d2,e + X 2 /-i{/ 5 i — /?2) + X2,e-2), 

Xk,£ = {oik — Ok+i) ^ {dk+i/ + Xk+i/-i{/5i-i — jdf) -I- Xk+i/-2 — Xk-i,e) 

iov k = 2,... ,n. At least for A: < n — we may use assumptions = Tkj to 
rewrite the above equalities (more precisely, the right hand side expressions in the 
above equalities) as 

xi,(. = {oig, — ag+i)~^ {d2^g + T2/-i{l3i — (52) + r2/-2), (4.48) 

Xk,e. = (ofc — Ofc+i) ^ {dk+i,t + ^k+i,i{f^i-i — fdg) + ^k+i/-2 — Xk-i,e) ■ 

We again compare (14.471) with (14.9p to see that xu = and that Xk/ and Tk^i 
are defined by the same recursion. Therefore Xk^g = ^k,t for ^ = 5,... ,n — £, and 
then we conclude by induction that equalities (I4.37D hold for j = 1,..., m and that 
Xk^j = Tfc j for all A: < n and j < m such that k + j < n. 

It remains to show that conditions (I4.37P are sufficient for the equation (14.6p to 
have a solution. Let us extend the spherical chain a = (ai,... ,an) to a spherical 
chain S = (ai,..., an, otn+i, ■ ■ ■, an+m)- We may let, for example, an+i = On for 
i = 1,..., m. We then let 

A = Ja = = 


A 0 
* * 


and 


A' = [ai6i^_ 


e in-tm _ 


A' 

* 


0 

* 
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to be the corresponding extensions of A and A' (see (13.81) ) and we arbitrarily extend 
the given matrix C to 


d 


c 

c 




J 2 =l,...,n+m 




(we may choose C" = 0). We then consider the matrix D' = CB — A'C € 
which is indeed an extension of the matrix (|4.7p : 


D' = dB- A'd 


C 

c 

CB 



D' 



'c 


a 


D 

D' 


[di,j 


J2=l,...,n+m ' 


We now use recursions (|4.9I) . ()4.10p to define the elements for all positive k < 
n + m and j < m such that k + j < n + m; see Remark 14.21 (part (1)). By Remark 
14.21 foart (3)) and by formula (14.91) . the elements (for k + j < n) are completely 
determined by the entries of the original matrices A, B and C. Since equalities 
(I4.37P hold (i.e., the elements Sj in (|4.1ip are equal to zero for j = 1,... , m, Lemma 
14.31 applies to the extended set {Lfc^- : k + j< n + m} and guarantees that equalities 
(j4.12p hold for all A: < n + m and j < m such that k + j < n + m. In particular, 
equalities (j4.12p hold for all 1 < A; < n and 1 < j < m. Furthermore, if we let 

r = 


€ 


(4.49) 


then (as it is readily seen from (14.11) 1 defined as in ()4.12l) is the (A,j)-entry in 
the matrix 

A = . = AT-FB-C. 

Equalities (14.121) for all 1 < A < n and 1 < j < m mean that A = 0, i.e., that F is a 
solution to the equation (14.61) . □ 


The last theorem suggested an algorithm for constructing a particular solution 
for a solvable singular Sylvester equation. We record it for the convenience of future 
reference. 


Algorithm 4.6. Given A and B of the form M where a = (oi,..., a„) and 
(3 = (/3i,... ,/3n) {m < n) are spherical chains from the same conjugacy class, and 
given C = G 

(1) Compute dij ioi i+ j <n by formula (14.81) . 

(2) Compute Fjj for f + j < n by formulas (14.9p . (|4.10p . 

(3) Verify equalities (14.111) for j = 1,... ,m. If they hold true, equation (14.61) 
has a solution. 

(4) Let an+i = «n and Cn+ij = 0 for i, j = 1,..., m. 

(5) Compute dij for n + 1 < i + j < n + m by formula ()4.8p . 

(6) Compute Fjj for n + 1 < f + j < n + m by formula ()4.9p . 

The matrix (I4.49P is a particular solution of the Sylvester equation (j4.6l) 

It is of particular interest to write necessary and sufficient conditions (I4.37p for 
solvability of a singular Sylvester equation (14.6p in terms of given matrices A, B 
and C. We were able to establish such formulas only in the case where A and B 
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are Jordan blocks (i.e., A and B are based on spherical chains of the form (I2.3p b 
Theorem lO below is the quaternion analog of a result of Ma m ; see ((LTD. 

Theorem 4.7. Let A and B be of the form (Sm with ai = a /3 = fl <i<n, 
1 < j < m < n). Given a matrix C = [c^j] G let 

di,i = ciq/3 - acpi, 

di,j = cij/3 - acij + cij-i {j > 2), 

dk,i + Ck—i^i (k ^ 2), 

dk,j T Cfc—ij T {k,j ^ 2). 

Sylvester equation dMD has a solution if and only (c.f. with 

^ Im(a) • d 2 e+i,j- 2 £ + ^ d 2 t,j- 2 t+i ■ Im(/3) = 0 for j = I,...,m. (4.50) 

2£<j 2l<j 


Proof. We will show that in the present setting, conditions (|4.37p are equivalent to 
(j4.50p . We fist observe that |Im(a)| = |Im(/3)| by characterization (j2.1|) . Therefore, 
(a — a)^ = (/? — /?)^ is a negative number and therefore 

(a — a)~‘^d{/3 — /3)^ = d for any d G H. (4.51) 

Making use of notation (jd.lip and formula (jd.Op for Tij-i (specified to the present 
particular case) we get 

{a-a)Sj ={a-a) [dij +rij_i(/3 - /3) + rij_ 2 ] 

={a — a){dij + rij_2) + (d2j-i + ^2,j-3 + ^2,j-2{f^ — P)) {P — P) 

={a - a)dij + d2,j-i{P - P) 

+ ^ 2 ,j-z{P — P) + {ot — a)^r 2 j -2 + (a — a)rij_ 2 , (4.52) 

where the last equality holds due to (I4.5ip . Making subsequent use of formula (|4.9I) 
for T 2 j -2 and then for Ts^-s, we conclude that the sum of the three rightmost terms 
in (I4.52P equals 

^ 2 ,j-z{P — P) + {a —a) [d3j-2 + r3j-4 + ^3,j-3{P — /3)] 

= {a — a) [d3,j-2 + r3j-4] + \dA,j-3 + + ^A,j-A{P ~ P)\ {P ~ P) 

= (a - a)d3,i-2 + dij-ziP “ P) 

+ r4j_5(/3 — P) + {a — a)^r4j_4 + (a — a)r3j_4. 

which being substituted into (14.5211 gives 

(a - a)Sj ={a - a){dij + d3,j-2) + id 2 ,j-i + d4j-3)(/3 - P) 

+ ^Aj-diP — P) + {a — a)^r 4 j _4 + (a — a)r 3 j_ 4 . 


(4.53) 
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The step which led us from (j4.52p to (I4.53|) can be repeated indefinitely. After I 
iterations we get 


l-\ i 

{a - a)Sj ={a - a) ^ d2fc+ij-2fc + ^ cl2fcj-2fc+i(/3 - (4.54) 

k=0 k=l 

+ ^2£,j-2£-iiP — /?) + (a — a)‘^T2e,j-2e + (a — a)T2£-ij-2e- 

Recall that and dkj are equal zero whenever one of the indices is nonpositive. 
If we choose i be greater than j/2 in ()4.53p and take into account only nonzero 
terms, we conclude that {a — a)Sj equals the expression on the left side of (|4.50n . 
Thus, conditions (|4.50p are equivalent (since a 7 ^ a to those in ()4.37p and applying 
Theorem 14.51 completes the proof. □ 


5. Singular case: parametrization of all solutions 

Since the equation (I4.6p is linear, all its solutions X are described by A = T + T, 
where Y is the general solution of the homogeneous equation 

AY -YB = 0. (5.1) 

we will use notation T>i(Y) = {Tfcj : k + j = i + 1} for the £-th counter-diagonal of 
the matrix Y = 

In this final part of the paper, we will present a parametrization of the solution set 
of the homogeneous equation (15.ip with free parameters fj,j G (j = 1, ..., m) 

from the planes C H defined as in (|4.4I) . The latter memberships mean, by 

Lemma l4.ll that 

an^J,j = fijPj for j = 1,.. ., m. (5.2) 

For any fixed tuple fj, = {fii,..., fim) of such parameters, we recursively define the 
entries Yj.j of the matrix Y = G the formulas 

Ykj =0 for all k + j<n, Ynp = Hi, (5-3) 

kfcj (Ofc-|-l Clfc) \^k+l,j—l{Pj Pj — l') ^k+l,j—2 T jj (^'4) 

{n — m + 1 < k < n — 1; n — k + l<j<n + m — k), 

i p \ 

“Mj+l ^ ^ I 2+1 Otn—i) j (2 ^ J ^ Tft). (^•^) 

e=i \i=l J 

For the sake of clarity, we display formula (15. 5 p in a less compact form 

knj+l —/^J+1 l) 

(^n—1 ^n— 2 ) ^n—lj • • • 

1 ) ■ ■ ■ ^n—j) ^n—j+lj- 
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/ ^ 

' 1 

\ 


(7 \ 


z+1 ^n—i) 

-1 


■ 

(5.6) 

V=^ 

J 


v=' / 



The construction of Y goes as follows: with (j5.8|l in hands, we use formula (j5.4p to 
compute all entries in the counter-diagonal T>n+iiY), it is not hard to see that 




In particular, for j = 2 we have 

hn—1,2 — {ctn Q^n—l) /I-l (/^2 /^l) (^•'^) 

and then we use (j5.5j) to compute 

hn,2 — Il2 (o^n CIn—l) Yn^l = /r2 (On On—l) ^1, (^•^) 

and then again (15.411 to compute the entries in T>n+2(h"). Once all counter-diagonals 
T>i{Y) are found for r = 2,..., n -|- j -|- 1 (and in particular, are specified for 
all k = we use (|5.5I1 to find Ynj+i and then (l5.4ll to compute the next 

counter-diagonals T)n+j+ 2 {Y). 

Theorem 5.1. Let fi = (fii,..., fim) satisfy (j5.2p . Then the matrix Y € 
constructed as in (|5.3|) -(j5.5|) is a solution to the equation dSU). Moreover, any 
solution Y to the equation dSH) arises in this way. 

Proof. The recursion formula (j5.4p is the homogeneous version of (j4.9[) (with all Ckj 
or, equivalently, all dkj equal zero). Hence, some conclusions will be obtained by 
applying the corresponding counter-parts from the previous section. In analogy to 
the non-homogeneous case, we introduce 


^k,j ■— OikYj^ j T Yk—\j Ykj—i 


(5.9) 


for k = 1,..., n and j = 1,..., m, and recall relations (14.331) holding for all A; > 3 
and j >2. We write them equivalently as 


^k-i,j — (ttfc+i — Oik)^k,j — Afc+ij_i(/3 ■ — (dj-i) -I- Afc+ij_2 


(5.10) 


and observe that iS.k+i,j -2 and L^k-i,j are the consequtive entries on the {k + j + 1)- 
th counter-diagonal of the matrix A = [A^j] whereas Afc_|_ij_i and lS.k+i^j -2 are 
the entries from the “previous” counter-diagonal T>k+j{^)- Thus, relations (j5.inp 
guarantee that once all entries in Pr(A) are zeros for some r > n and the bottom 
entry of the next counter-diagonal T>r+i(A) is zero, then all entries in Pr_|_i(A) 
are zeros. In other words, the following implication holds true: for any fixed r 
(n < r < n + m), 


^k,r-k = 0{r — m<k< n), 


A, 


n,r+l—n 


-n = 0 


Ak^r+i-k = 0 {r — m + 1 < k < n). (5.11) 


Furthermore, due to ()5.3I) . Akj = 0 for all {k,j) such that k + j < n. We next 
observe that due to (|5.2p . 


An,i = - mPi = 0. 


(5.12) 
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Making use of (j5.7p . (j4.13p and (|5.12D we get 

^n-1,2 = an-l^n.-l,2 “ '^71-1,2^2 

= an-l{an - Q;n-l)"Vl i/32 - I3l) - [an - Q!n_i)“Vl W 2 “ /3i)/32 

= (On - an-l)~^ (ttn/Ul “ (/32 “ A) = 0. 

Since = 0 for all k,j such that k + j = n and since A„,-i ,2 = 0, we recursively 

get from (15.1011 that Akj = 0 for all k,j such that k + j = n + 1 (i.e., that all the 
entries in VniA) are zeros). To use induction principle, let us assume that A^^i = 0 
for all k, i such that k + i <r for some r > n + 2 and show that = 0 also for all 
k,i such that k + i = r + 1. Let j := n — r. By the assumption, we have in particular, 
Afcj- = 0 for all A: = 1,..., n which implies, due to formula (15.911 . equalities 

Otk^k,j ^k,j/3j — ^k,j—l ^k—l,j for k — ‘2, . . . , TL. (5.13) 

Since Re(afc) = Re(/3j) (by characterization (l2.ip L the latter equalities can be writ¬ 
ten equivalently as 

OkYkj - Yk,jl3j = Yk-i^j - Ykj-i for k = 2,..., n. (5.14) 

We next compute, for a fixed k {2 < k < n — 1), 

hfcj'+i YkJ^ij 

i^k+i Q^fe) [hfc+ij(/3j /3j-|-i) Yk-^-lJ—l -|- Yk—lJ-^-l 
i^k+l CIfc)bfc+lj] 

(®fc+i *Tfc) Yk-^.lJYkj T 

C^A:+l(®fc+l CVk) Yk-\-lj (o;fc_|_i Oik) Yk-\-Xj 

T iok+\ Ok) \Yk—i,j+i ) (5.15) 

where for the first equality we used formula (|5.4I1 for Yk^j+i, the second equality 
holds due to (j5.14p (with k replaced by {k + 1)), and the last equality relies on 
(I4.13P (for a = Ok+i and /3 = Ok)- Writing formula (|5.14p for /c = n — 1 and then 
iterating it once and making use of (I4.13p . we get 

Yn—lJ+l YnJ —OniOn On—l) YnJ (<Tn On—l) Yn,j l3j-\-l 

T {On On—l) [®n—l(®n—1 On— 2 ) Yn—lj 

{On-l On— 2 ) Yn—ljl3j-\-l 

T(<an,—1 On— 2 ) [bn—3j + l b)i,_2j]] 

On/On On—l) Yn,j (<Tn On—l) Ynj/3j-\-i 

T On/On On—l) {On-l On— 2 ) Yn—lJ 

iOn On—l) {On-l On— 2 ) Yn—lJ/3j-^-l 

T {On On—l) {On-l On— 2 ) [bn—Sj'+l b^_2j] • 

We can continue iterating by invoking the formula (15.1411 for k = n — 3. After j 
iterations, we will get to the difference [Yn-j-ij+i — Yn-jj] of two entries from the 
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counter-diagonal T>n+iiY) which are zeros, by (15.3|) . Thus, j iterations of formula 
(j5.14j) (for k = n — 1,... ,n — j) results in 


rx 
3 I i 


hn—Ijj+l j ^ ^ i-(-l C^n—i 

J=1 \i=l 




Y„,_ 


n—i+l,j 


rx 

3 I e 


^ Yn-e+i,j I3j+1. (5.16) 


f=i \i=i 


We now combine latter formula with formulas (j5.9j) . (j5.5p for Anj+i and Ynj+i, 
respectively, to compute 


^nj+l — C^nYnJ+l T Ij+l Yn,j — Ctn/I-jr+l /^J+l/^i+1 — 

where the last equality is the consequence of ()5.2I) . Due to induction hypothesis 
and (I5.17p . me may apply implication (15.111) to conclude that Aj, j = 0 also for all 
k, i such that k + i = r + 1. By induction principle, Aj, ^ = 0 for all /c = 1,..., n, 
j = 1,... ,m and since A = [A^j] = AY — YB (by definition (I5.9p l. we conclude 
that y is a solution to (j5.1l) . 

It remains to show that any Y solving ()5.ip is necessarily of the form (I5.2l) - (j5.5l) . 
By Theorem 14.51 (part (2)), the entries Y^j (for k -\- j < n) are the same for any 
solution Y to the equation (I4.50p . Since the zero matrix is a solution, it then follows 
that Yfc j = 0 whenever k + j < n. Equating the entries from the bottom row in 
dST]), we get, on account of ()4.ip . 

OlnYn,! - Yn,iPi = 0, (5.18) 

OnYnJ - Ynjl3j -b W-IJ “ = 0 {j = 2,... ,m). (5.19) 

Equation (|5.18l) simply means that Yn^i = mui can be picked arbitrarily in 

the solution set of the scalar Sylvester equation (I5.18p . By (the proof of) Theorem 

14.51 any solution Y to the equation (|5.ip also satisfies 

AY -YB = 0, (5.20) 

where A and B are given in (lO.lOh and (j4.39p . Equating the (n — 1,2) entries in 
(j5.20p we get («„ — «n-i)yn-i ,2 — yn,i(/32 “ 3i) — 0) which is equivalent to (j5.7p 
and thus, verifies (15.4p for k = n — 1 and j = 2. Then we can apply formula (I5.14p 
for /c = n — 1 and j = 1 to get (we recall that ^ 1 - 2,2 = En-i,! = 0) 

En— 1,2 Yn^l = Otniptn CXn—l) (cin Cln—l)E^p/32, 

which, being substituted into equation (|5.19l) (for j = 1) leads us to 
0 =anYn,2 — Yn^2h + ^* 1 - 1,2 “ 

—C^n (Yn,2 T (cKn CIn—l) El^q) (Yn,2 T (®n CIn—l) En,l) 32- 
The latter equality tells us that the element 

/I'2 •— E)^ 2 T (<Tn CIn—l) Y^^l 

must satisfy equation (|5.2h for j = 2, which verifies formula (15.5h for j = 1- So far, 
we verified formulas ()5.4I) and (j5.5ll for the entries from the two leftmost columns 
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of Y. Let US assume the formulas hold true for all entries from the r > 2 leftmost 
columns and let us write explicitly conditions [AY — YB]k+i^r+i = 0 (see (|5.20p l for 
A; = 1,..., n — 1; in view of (|3.10l) and (|4.39l) . we have 

which being solved for Yfc,r+i verifies formulas (j5.4l] for j = r + 1 and k < n — 1. 
Since the formulas (j5.4|] and (j5.5p hold for j = r and all k < n by the assumption, 
equality (15.161) holds for j = r. Substituting this equality into equation (I5.19|) (for 
j = r + 1) gives 


0 — OinYn,r+l Yji^r+lf^r+l 4 “ Yn—l^r+1 Y^i^r 

r / ? 

— Oin. I Yn^r+1 4“ ^ ^ j J^(®n— j+1 CXn—i) | Y^— 


l+l,r 


g=l \i=l 

r\ 

1' t 


Yn,r+1 4“ ^ ^ I J^(<Tn— i+1 Ctn—i) j j. j 


£=1 \i=l 

We conclude that the element 


r\ 

I 


^r+1 — 


W,r+1 4- j JJ( 


^n—i+1 ^n—i 


Y„,_ 


n—£+l,r 


i=l \i=l 


must be in which verifies (I5.5p for j = r + 1. By induction principle, 

formulas p5.4p and (|5.5p hold for all j = 1,..., m. □ 


Corollary 5.2. Let A = Zn{oi) and B = ImiP) (n > m) be Jordan blocks based on 
the elements a ^ /3. A matrix Y = [Y^j] € satisfies AY = YB if and only if 

Ykj = 0 (2 < k + j < n) and Y^j = ^k+j-n {n + l<k + j<n + m), (5.21) 

where /Ui ,..., ytm G H are any elements from the plane (see (14.4p and ra;, 
i.e., 

am = fiil3 for i = l,... ,m. (5.22) 


Proof. It suffices to let Uk = a and fij = /3 in Theorem 15.11 and to show that 
formulas (|5.3I) ~ (I5.5I) amount to (|5.21l) in this particular setting. This is clearly true 
for /c + 1 < n and for Y^^i = p-i- For the rest, we will use equalities 

am = mP (i = l,...,m), (5.23) 

which follow from p5.22p since Re(Q!) = Re(/?), by (12.11) . Due to equalities (j5.22l) 
and (I5.23p . formula (15.6p takes the form 

Yn-jj+i = {a- a)“Vi(/S - py = m (j = 1,... , m - 1). 

By (15.81) . Yn ^2 is necessarily of the form W .^2 = /i 2 — (« — Ci)~^fii where m is any 
element in But since 

a{a — a)~^p,i = {a — a)~^a{a — a)~^p,il3, 
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i.e., since (a — G it follows that ¥^^2 can be chosen arbitrarily in 

In other words, we may let Yn ,2 = /^2 G ^a,i3- Since Yn-j,j+i = /^i for all 
j = 0,..., m — 1, the formula ()5.4p (for k + j = n + 2) now gives 

Yn-j+i,j+i = {a- a)"V2(/3 - = ^^2 {j = 2,... ,m - 1) 

verifying the Hankel structure of T>n+i{Y). Now we have by (15.5p . 

I^n,3 = fJ- 3 - (a- a)“V 2 - (a - 

and since the second and the third terms on the right belong to and /is can 
be chosen in Y.a,p arbitrarily, we may let 1^,3 = /is G Now we use formula 

(15.4p and take into account the Hankel structure of T>n+i(Y) to verify the Hankel 
structure of Vn+ 2 iY). Repeating this argument m times we arrive at the desired 
conclusion. □ 

Recall that all results in Sections 4 and 5 were obtained under assumption that 
n > m. The assumption is not restrictive in the following sense: if m > n, then 
one can apply the obtained results to the adjoint equation (|3.24p as in the proof of 
Theorem 13.91 i.e., to replace A, B and C by R*, A* and C*, respectively. We omit 
further details. 
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